SCHEME FOR SYSTEMATICALLY REGISTERING META-DATA 
WITH RESPECT TO VARIOUS TYPES OF DATA 

5 BACKGROUND OF THE INVENTION 
FIELD OF THE INVENTION 

The present invention relates to a meta-data 
registration scheme to be utilized in a data server device. 
10 In the following description; a word "meta-data" stands for 
any data on a registered data (or data to be registered) 
which can be utilized in handling or managing the 
registered data. 

15 DESCRIPTION OF THE BACKGROUND ART 

In a system for providing information to users through 
information terminals and information processing by 
computers, a data server device has been used as a device 
for registering and storing data, from which a specified 

20 data can be retrieved or data satisfying a desired 

condition can be searched and retrieved. The data server 
device can be implemented in a form of a file system or a 
database management system which is a part of an operating 
system of a computer, or a Web server, file server or 

25 database server with respect to which data can be entered 
or retrieved through a network. 

A Web server which is one variant of the data server 
device manages various data on a name space having tree- 
like hierarchical structure as shown in Fig. 1. In this 

30 example, "/pro j/doc/f ig/imagel . gif " and 

"/proj/doc/f ig/image2 . gif " are image data in GIF format, 
"/proj/doc/spec . doc" is a document produced by a word 
processor (MS-WORD) , "/proj/src/program. c" is a source code 
of a program written in the C language, and 

35 "/sample/index. html" is a document file written in HTML. 
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Nodes that classify data such as nodes specified by "/proj" 
or "/pro j/doc/f ig" are generally referred to as 
directories. In each directory, arbitrary many data or 
directories can be included. A directory "/" that is at a 
5 top level in Fig. 1 will be referred to as a root 
directory. 

Data managed by the Web server can be accessed by the 
HTTP protocol through a network. The HTTP protocol is 
disclosed in "Hypertext Transfer Protocol --HTTP/1.0" (RFC 
10 1945) and "Hypertext . Transfer Protocol --HTTP/1.1" (RFC 
2068). When some client wishes to obtain data of 
"/proj/doc/f ig/imagel . gif " , for instance, this client sends 
a GET request having a header in a format of "GET 
'ft /proj/doc/f ig/imagel .gif HTTP/1.1" to this Web server. Upon 

In 15 receiving this request, the Web server returns the 
l:^ requested data. 

! s Pf- In order to register data on the Web server, a PUT 

request of the same HTTP protocol is used. When some client 
;! wishes to register a new image data in the GIF format under 

^ 20 the name of "/pro j /doc/f ig/image3 . gif " , for instance, this 
| s [J client sends a PUT request having a header in a format of 

fy "PUT /proj/doc/f ig/image3. gif HTTP/1.1" to this Web server, 

j. 3- 

£3 while containing the image data to be registered in a body 

of this PUT request. Upon receiving this request, the Web 
25 server stores the received image data under the specified 
name . 

In the HTTP protocol , a type of data to be exchanged 
by GET or PUT in terms of a MIME type. For instance, a type 
of data to be transferred is specified by a header in a 
30 form of "Content- type : image/gif". 

A UNIX file system which is another example of the 
data server device manages various data (called files in 
the case of the file system) on a name space having a tree- 
like hierarchical structure as shown in Fig. 2. In the 
35 example of Fig. 2, "proc.c" and "sys.h" are referred to as 
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files, while "src" and "include" are referred to as 
directories. At a time of accessing a file in the file 
system, a file name of a file to be accessed is specified 
by a file name having a hierarchical structure such as 
5 "/src/kernel/proc . c" . 

It is well known that data managed by the data server 
device can be handled more conveniently by attaching meta- 
data to each data. There are various examples of the meta- 
data including creation date and location, a creator, an 

10 access control information, an update log, a data source 
(information indicating where this data comes from), a 
channel, an EPG (electronic program guide), a thumb nail in 
the case of image data, an icon to be displayed by GUI, a 
position to be displayed on a display, an application that 

15 created that data, an application for browsing that data, a 
MIME type of data, a size of data, and keywords to be used 
in the retrieval. 

Meta-data is also referred to as property or attribute 
in some contexts. 

20 When data are managed by attaching meta-data, it 

becomes possible not only access data by specifying its 
name, but also retrieve the data by specifying a value of 
its meta-data, or manage information necessary for 
applications in relation to the data. 

25 A scheme for registering meta-data in the Web server 

is disclosed as a part of the specification called WebDAV 
in "HTTP Extensions for Distributed Authoring — WEBDAV" 
(Internet Draft: draf t-ietf -webdav-protocol-10 ) . In the 
WebDAV, it is possible to attach meta-data called property 

30 to data managed by the Web server. 

The Web server compatible with WebDAV manages data as 
shown in Fig. 3. It is basically the same as the Web server 
shown in Fig. 1, but unlike the usual Web server, it 
manages data and meta-data (property) in relation to each 

35 other. The property is maintained by a pair of a name and a 
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value. In the example of Fig. 3, data specified by 
"/doc/f ig/imagel .gif " has three properties of "author", 
"date" and "access". The "author" property has a creator of 
the data as its value, which is "T. Sato" in this case. The 
5 "date" property indicates the date on which the data was 

created and the "access" property indicates an access right 
information in this example. 

In the WebDAV, a directly is referred to as a 
collection. In Fig. 3, intermediate nodes of the tree 
10 specified by "/" or "/doc/fig" are collections. In the 

specification of the WebDAV, it is possible to attache a 
meta-data to the collection as well. In the following 
description, this collection will also be referred to as a 
directory . 

15 In the WebDAV, new request methods are added by 

extending the HTTP protocol in order to access the 
property. More specifically, a PROPFIND method is provided 
in order to extract a value of the property, and a 
PROPPATCH method is provided in order to set up or delete a 

20 value of the property. By issuing a request using these 
methods, it becomes possible to access the property. The 
bodies of PROPFIND or PROPPATCH requests and replies are 
required to be described using XML. 

A scheme for retrieving data managed by attaching the 

25 property according to the specification of the WebDAV is 
called a DASL, which is disclosed in "DAV Searching & 
Locating" ( Internet Draft : draf t-reddy-dasl-protocol- 
04.txt). The DASL adds a SEARCH method for the retrieval by 
expanding the HTTP protocol, such that when a SEARCH 

30 request containing a retrieval condition described by XML 
in its body is sent, a corresponding retrieval result 
written in the same XML will be returned. In the retrieval 
condition, a value of a property of a specific name can be 
specified with respect to a certain property or a 

35 combination of a plurality of properties. 



-4- 




An exemplary case of registering meta-data in the UNIX 
type file system includes a Be file system disclosed in 
Dominic Giampaolo: "Practical File System Design with the 
Be File System", Morgan Kaufmann Publishers, Inc., ISBN 1- 
5 55860-497-9. The Be file system manages files in the name 
space having a tree-like hierarchical structure similar to 
the UNIX, where meta-data called attribute can be attached 
to each file. The attribute is given by a pair of a name 
and a value, where the name is given by a character string, 
10 and the value is given by data such as a character string, 
integer, real number, binary number, etc. 

Similarly as in the DASL , files can be retrieved using 
an attribute value or a combination of attribute values in 
u f the Be file system. 

in 15 As described, in the data server device capable of 

I;- managing meta-data, it is possible to manage various 

Lfl information as meta-data in relation to the data. In 

!j addition, it becomes possible to realize sophisticated 

. s accesses based on the retrieval using various information 

I s * 20 as keys, rather than a simple access using a name of data 
hj or file. Furthermore, the construction of an application 

ry program can be also made easier as the application manages 

i»3 information related to data as meta-data in relation to the 

data. 

25 The data server device for managing meta-data in 

relation to data has various advantageous features as 
described above, but the full advantages of these features 
cannot be taken unless meta-data are actually attached to 
individual data. However, attaching meta-data to individual 

30 data requires enormously tedious tasks. 

There are meta-data such as creation date and size of 
a file which can be attached automatically at a time of 
registering data. However, such meta-data are limited only 
to the generic meta-data that do not depend on data type 

35 and the like. If tasks to attach meta-data can be made 
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automatic somehow, even to some extent, for the other kinds 
of meta-data, it is possible to expect that the user will 
be relieved from tedious tasks. 

Also, at a time of attaching meta-data, if different 
5 names are used for the meta-data having the same meaning 
depending on persons who attach them or times at which they 
are attached, or if the meta-data are attached to the data 
of the same type under arbitrarily different names by 
different applications, it becomes impossible to realize 
10 the effective retrieval later on. Consequently, there has 
been a demand for a way to attach meta-data systematically 
to some extent. 

m 15 SUMMARY OF THE INVENTION 

Lnj It is therefore an object of the present invention to 

provide a scheme for systematically registering meta-data 
with respect to various types of individual data so as to 

;=3 20 enable sophisticated retrieval or application program 

I- Us 

ry construction assistance utilizing the meta-data in the data 

iU server device. 

ft According to one aspect of the present invention there 

is provided a meta-data registration method, comprising the 

25 steps of: detecting a data type of registering data to be 
registered into a data server that registers and manages 
data and meta-data for data; selecting one procedure 
corresponding to the data type detected by the detecting 
step, from a plurality of procedures provided in 

30 correspondence to respective data types and stored in 
advance, each procedure having a program code for 
generating the meta-data for data according to a 
corresponding data type; generating the meta-data for the 
registering data by executing the program code of said one 

35 procedure selected by the selecting step; and registering 
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the meta-data generated by the generating step in relation 
to the registering data into the data server. 

According to another aspect of the present invention 
there is provided a meta-data registration device, 
5 comprising: a detection unit configured to detect a data 
type of registering data to be registered into a data 
server that registers and manages data and meta-data for 
data; a selection unit configured to select one procedure 
corresponding to the data type detected by the detection 

10 unit, from a plurality of procedures provided in 

correspondence to respective data types and stored in 
advance, each procedure having a program code for 
generating the meta-data for data according to a 
corresponding data type; and an output unit configured to 

15 output the meta-data for the registering data generated by 
executing the program code of said one procedure selected 
by the selection unit, to the data server. 

According to another aspect of the present invention 
there is provided a data server device, comprising: a 

20 request processing unit configured to receive and process a 
registration request for registering data to be registered 
into the data server device; a management unit configured 
to register the registering data according to the 
registration request and manage data and meta-data for 

25 data; a detection unit configured to detect a data type of 
the registering data; a selection unit configured to select 
one procedure corresponding to the data type detected by 
the detection unit, from a plurality of procedures provided 
in correspondence to respective data types and stored in 

30 advance, each procedure having a program code for 
generating the meta-data for data according to a 
corresponding data type; a generation unit configured to 
generate the meta-data for the registering data by 
executing the program code of said one procedure selected 

35 by the selection unit; and a registration unit configured 
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to register the meta-data generated by the generation unit 
into the management unit. 

According to another aspect of the present invention 
there is provided a computer usable medium having computer 
5 readable program codes embodied therein for causing a 
computer to function as a meta-data registration device, 
the computer readable program codes include: a first 
computer readable program code for causing said computer to 
detect a data type of registering data to be registered 
10 into a data server that registers and manages data and 

meta-data for data; a second computer readable program code 
for causing said computer to select one procedure 
corresponding to the data type detected by the second 
computer readable program code, from a plurality of 
!,n 15 procedures provided in correspondence to respective data 
Jlft types and stored in advance, each procedure having a 

if! program code for generating the meta-data for data 

'ft according to a corresponding data type; and a third 

* computer readable program code for causing said computer to 

; s= f 20 output the meta-data for the registering data generated by 
f y executing the program code of said one procedure selected 

l J£ by the second computer readable program code, to the data 

i!3 server. 

According to another aspect of the present invention 
25 there is provided a computer usable medium having computer 
readable program codes embodied therein for causing a 
computer to function as a data server device, the computer 
readable program codes include: a first computer readable 
program code for causing said computer to receive and 
30 process a registration request for registering data to be 
registered into the data server device; a second computer 
readable program code for causing said computer to register 
the registering data according to the registration request 
and manage data and meta-data for data; a third computer 
35 readable program code for causing said computer to detect a 
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data type of the registering data; a fourth computer 
readable program code for causing said computer to select 
one procedure corresponding to the data type detected by 
the third computer readable program code, from a plurality 
5 of procedures provided in correspondence to respective data 
types and stored in advance, each procedure having a 
program code for generating the meta-data for data 
according to a corresponding data type; a fifth computer 
readable program code for causing said computer to generate 

10 the meta-data for the registering data by executing the 
program code of said one procedure selected by the fourth 
computer readable program code; and a sixth computer 
readable program code for causing said computer to register 
the meta-data generated by the fifth computer readable 

15 program code into the second computer readable program 
code . 

Other features and advantages of the present invention 
will become apparent from the following description taken 
in conjunction with the accompanying drawings. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a diagram for explaining a conventional data 
25 management method used in a Web server. 

Fig. 2 is a diagram for explaining a conventional data 
management method used in a UNIX file system. 

Fig. 3 is a diagram for explaining a conventional data 
management method used according to WebDAV. 
30 Fig. 4 is is a schematic block diagram showing a 

general configuration of a data server device. 

Fig. 5 is a block diagram showing an exemplary 
configuration of a meta-data registration device according 
to one embodiment of the present invention. 
35 Fig. 6 is a block diagram showing an exemplary 
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configuration of a data server device incorporating the 
meta-data registration device of Fig. 5. 

Fig. 7 is a diagram showing an exemplary meta-data 
generation procedure table used in the meta-data 
5 registration device of Fig. 5. 

Fig. 8 is a flow chart for an exemplary meta-data 
registration processing by the meta-data registration 
device of Fig. 5. 

Fig. 9 is a diagram showing an exemplary meta-data 
10 generated by the meta-data registration processing of Fig. 
8 . 

Fig. 10 is a diagram showing an exemplary XML document 
that can be handled by the meta-data registration device of 
;;| Fig. 5. 

;= r § 15 Fig. 11 is a diagram showing an exemplary XML meta- 

i=fs data correspondence table that can be used in the meta-data 

s»n registration device of Fig. 5. 

iil- 
'S Fig. 12 is a diagram showing an exemplary meta-data 

52 generated from a XML document data by the meta-data 

.3 IS* 

l7s 20 registration device of Fig. 5. 

rU Fig. 13 is a block diagram showing another exemplary 

; 5 ~ configuration of a data server device incorporating a meta- 

O data registration device according to one embodiment of the 

present invention . 
25 Fig. 14 is a block diagram showing still another 

exemplary configuration of a data server device 

incorporating a meta-data registration device according to 

one embodiment of the present invention. 



30 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Referring now to Fig. 4 to Fig. 14, the preferred 
embodiments of a meta-data registration scheme according to 
35 the present invention will be described in detail. 
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In short, in the meta-data registration scheme of the 
present invention, a plurality of meta-data generation 
procedures (programs) for respective data types are stored 
in a computer (a meta-data registration device or a data 
5 server device) in advance, a meta-data registration 

procedure corresponding to a type of data to be registered 
into the data server device is selected from these stored 
meta-data generation procedures, and a meta-data is 
automatically generated using the selected meta-data 
10 generation procedure and registered in relation to the 
registering data. 

Fig. 4 shows a general configuration of a data server 
device 100, which comprises a storage device 103 which is a 
device for actually storing data such as a magnetic disk 

"a sr. 

Ul 15 device, a data management unit 102 for managing which data 
p;1 is stored in which region of the storage device 103, and an 

LI access request processing unit 101 which receives an access 

j s ~ request from an access requesting source which is either an 

application program or a user (Including a terminal device, 
: = 1 20 a client, etc., on a network), commands reading/writing of 
Fjj data necessary for the requested access to the data 

management unit 102, and returns a request processing 
□ result to the access requesting source. Here, the request 

processing result is given by either a status information 
25 (response) indicating whether the data writing has 

succeeded or not in the case of a data writing request (PUT 
containing data, for example), or the read out data or a 
status information indicating that the data reading has 
failed in the case of a data reading request (GET 
30 indicating request, for example). 

Fig. 5 shows an exemplary configuration of a meta-data 
registration device 200 according to one embodiment of the 
present invention, and Fig. 6 shows an exemplary 
configuration of a data server device 300 which 
35 incorporates the meta-data registration device 200 of Fig. 
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5 in addition to the general configuration of a data server 
device shown in Fig. 4. Note that Fig. 6 shows the meta- 
data server 200 only in a schematic configuration for the 
sake of simplicity but it should be understood that the 
5 meta-data server 200 actually has a detailed configuration 
as shown in Fig. 5. 

When a request for registering data (PUT, for example) 
arrives at the data server device 300, the access request 
processing unit 301 of the data server device 300 gives 
10 this data to the data management unit 302 which carries out 
a processing for registering this data into the storage 
device 303, while also giving this data to a data type 
detection module 201 of the meta-data registration device 
200. 

15 The data type detection module 201 checks a data type 

of the data received from the access request processing 
unit 301. 

A meta-data generation procedure selection module 202 
of the meta-data registration device 200 is managing a 

20 correspondence between various data types and meta-data 
generation procedures (programs) for data of respective 
data types, in a form of a table, for example. The meta- 
data generation procedure selection module 202 receives the 
data type detected by the data type detection module 201, 

25 refers to the above described correspondence table , and 

selects the meta-data generation procedure corresponding to 
that data type. 

A meta-data generation module 203 of the meta-data 
registration device 200 reads a program corresponding to 

30 the meta-data generation procedure selected by the meta- 
data generation procedure selection module 202, and 
generates a meta-data corresponding to the data given from 
the access request processing unit 301 by using this 
program . 

35 A meta-data registration module 204 of the meta-data 
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registration device 200 registers the meta-data generated 
by the meta-data generation module 203 in relation to the 
corresponding data, by commanding the data management unit 
302 accordingly, 
5 Note that each unit or module described above can be 

realized in a form of hardware, software, or a combination 
of software and hardware. It is also possible to realize 
the data server device or the meta-data registration device 
of the present invention by installing a program for 
10 realizing functions of each unit or module described above 
into a general purpose computer. 

Now, an exemplary case of applying the present 
invention to a Web server compatible with WebDAV will be 
; s j described. In this example, the data type is detected from 

UT 15 a MIME type or an extension of a data name of the data to 

be registered into the Web server. 
Lnl The Web server of this example has a meta-data 

?Z generation procedure table (data type/procedure 

« correspondence table) as shown in Fig. 7. This meta-data 

f7 ; 20 generation procedure table is a table indicating a 
| a y correspondence among three pieces of information including 

[J£ a MIME type, an extension of data corresponding to that 

□ MIME type, and a meta-data generation procedure 

corresponding to that MIME type. For example, this table 
25 indicates that data in the type of "image/ jpeg" has the 
extension of either "jpeg" or "jpg", and its meta-data 
generation procedure is specified by "metajpeg" . Here, the 
meta-data generation procedure name is actually given by 
either a pointer to a function or a name of a program 
30 module to be executed. 

Instead of providing such a meta-data generation 
procedure table, it is also possible to embed these 
correspondences into a meta-data registration program (a 
program for causing the general purpose computer to 
35 virtually operate as a meta-data registration device having 
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a configuration as shown in Fig. 5) as conditional 
branchings. 

When the Web server of this example receives a PUT 
request which is a data registration request, that data is 
5 registered under the specified name, while the meta-data 
registration processing as shown in Fig. 8 is started. 

In the meta-data registration processing of Fig. 8, 
whether the MIME type of the data to be registered 
(registering data) is specified in the header of the PUT 

10 request or not is checked (step SI) . 

If the MIME type is specified, the meta-data 
generation procedure table is referred using a name of the 
specified MIME type as a key, and the meta-data generation 
procedure for the data in that MIME type is taken out (step 

15 S2) . If the MIME type is not specified, an extension 

portion of a name of the registering data is taken out, the 
meta-data generation procedure table is referred using that 
extension as a key, and the meta-data generation procedure 
for the data having that extension is taken out (step S3). 

20 Here, when the MIME type of the registering data does not 
exist in the meta-data generation procedure table at the 
step S2, or when the extension of the registering data does 
not exist in the meta-data generation procedure table at 
the step S3, it indicates that the meta-data generation 

25 procedure for this data is not provided (step S4 NO), so 
that the processing is terminated without attaching any 
meta-data to the registering data. On the other hand, when 
the corresponding meta-data generation procedure is found 
(step S4 YES), this procedure is executed to generate the 

30 meta-data to be attached to the registering data (step S5) . 
The meta-data generated here is a collection of 
arbitrary numbers of pairs of meta-data name and value as 
shown in Fig, 9. In this example, the meta-data with a name 
"data" has a value "1998-07-12", the meta-data with a name 

35 "author" has a value "Y. Suzuki", and the meta-data with a 
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name "program" has a value "photoshop4 . 0" . After generating 
the meta-data in this manner, the meta-data is registered 
in relation to the registering data (step S6). 

In this example, the processing is immediately 
5 terminated at the step S4 when the meta-data generation 

procedure corresponding to the data type of the registering 
data is not found, but this part of the processing may be 
modified as follows. Namely, it is possible to generate a 
meta-data indicating that the meta-data generation 
10 procedure for that data was not found at the step S5 , and 
register this meta-data in relation to the registering data 
at the step S6. Alternatively, it is also possible to 
notify the user who is registering the data that the meta- 
data generation procedure corresponding to the registering 
U1 15 data cannot be found so that the automatic generation of 
meta-data is not carried out. By this notification, the 

j Ls 

j :T user can recognize the fact that the registering data has 

Id 

{m no meta-data attached so that this registering data cannot 

» be searched by the subsequent retrieval using the meta- 

.ass, 

j 1 20 data. This is also effective in warning the user when a 
i'Ll wrong data is registered or a wrong data registration 

method is used. 

The above described example employs both a method for 
detecting the data type according to the MIME type 
25 specified in the header of the PUT request and a method for 
detecting the data type according to an extension of a name 
of the registering data, with the higher priority given to 
the former method, but it is also possible to employ both 
methods with the higher priority given to the latter 
30 method, or either one of these methods alone may be 

employed. Also, after the step S5 executes the meta-data 
generation procedure selected according to the MIME type at 
the step S2 once, if it is found out that the meta-data 
generation was not carried out appropriately, it is 
35 possible to select another meta-data generation procedure 
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according to the extension once again, and execute this 
procedure to generate new meta-data once again. 

Also, as a method for detecting the data type, it is 
possible to employ a method which checks first several 
5 bytes of the data and judges the data type according to a 
value of the checked bytes, or a method which checks 
contents of data rather than the first several bytes and 
judges the data type according to the checked contents. For 
instance, in the case of data such as MPEG data, it is 

10 possible to judge that this data is MPEG data by checking 
whether the data contents include a data sequence in a 
particular pattern or not. 

Also, in the above described example, the meta-data 
generation procedure corresponding to the data type of the 

15 registering data produces a list of meta-data to be 
attached in a predetermined format as shown in Fig. 9 
first, and gives this list to the common processing step 
(step S6 of Fig. 8) which attaches the meta-data to the 
registering data, but it is also possible to implement the 

20 present invention such that the meta-data is attached to 
the registering data directly by the meta-data generation 
procedure . 

The meta-data generation procedures (programs) should 
be created for the respective data types in advance and 

25 provided in a computer (the data server device or the meta- 
data registration device). For example, a document data of 
an application such as word processor contains within 
itself information on a document creation date, a document 
title, a document creator name, etc., in a format suitable 

30 for that application. Consequently, the meta-data 

generation procedure can be created such that it extracts 
such an information contained in the document data itself 
as a meta-data in accordance with a format of the document 
data that is analyzed in advance. The similar meta-data 

35 generation procedure can be created also for a data format 
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such as PDF that is widely used in distribution of 
electronic documents. 

As disclosed in "Digital Still Camera Image File 
Format Standard (Exif) Version 2.1", JEIDA-49-1998 , Japan 
5 Electronic Industry Development Association, data of a 

digital camera based on the widely used EXIF specification 
internally contains various information such as a model of 
camera used for image taking, date and time of image 
taking, a person who took the image, an image title, a 
10 shutter speed, an aperture, a lens focal length, latitude, 
longitude and altitude of an image taking location, user 
comments, etc. The EXIF specification based data has the 
MIME type of "image/jpeg" or "image/tiff", and the meta- 

,z ss. 

. data generation procedure corresponding to these data types 

w 

=H 15 is implemented such that it checks whether or not the EXIF 

! i ti 

;=fS information region is contained in a file of the JPEG or 

UT TIFF format, and extracts the EXIF information as the meta- 

;2 data if the EXIF information region is contained. Even when 

i5 the EXIF information region is not contained, it is still 

^ 20 possible to extract some meta-data which is specific to the 
j'LS JPEG or TIFF format such as resolution. 

; s if As disclosed in "Recording Helical-scan digital 

□ video cassette recording system using 6,35 mm magnetic tape 

for consumer use (525-60, 625-50, 1125-60 and 1250-50 

25 systems) --" (IEC 61834), data of a digital video based on 
the widely used DV specification internally contains 
various information such as a recording date and time, a 
category indicating a recorded content, language, a channel 
in the case of recording a broadcast program, latitude and 

30 longitude of a recording location, various settings and 

device data of a camera, etc., within the video stream. The 
DV specification based data is often recorded as a file in 
the AVI format or the QuickTime format that are widely used 
on PCs, and has the MIME type of "video/x-msvideo" or 

35 "video/quicktime" . The meta-data generation procedure 
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corresponding to these data types is implemented such that 
it checks whether or not the DV data are contained in a 
file of the AVI or QuickTime format, and extracts the 
various DV data as the meta-data if the DV data are 
5 contained. 

It is also possible for the meta-data generation 
procedure to acquire some information not contained in the 
registering data from the external and attach it as the 
meta-data, in addition to simply extracting information 

10 contained in the registering data as the meta-data, as 

indicated by a dashed line with arrowhead labelled "other 
information" that is entered into the meta-data generation 
module 203 in Fig. 5. For example, in the case of the DV 
specification based video data, it is possible to acquire 

15 information regarding the program from an electronic 
program guide (EPG) according to information on the 
recording time and the broadcast channel contained within 
the data, and register information on the program title, 
the cast, etc., that is contained in the acquired 

20 information as the meta-data. As for a method for 

extracting information from the electronic program guide, 
the electronic program guide information that is 
transmitted by utilizing a part of the broadcast signals 
can be stored in this data server device or a computer 

25 connected to this data server device, and information on 
the pertinent program can be extracted from there. 
Alternatively, it is also possible to extract the 
electronic program guide information on the pertinent 
program from another server that is registered in advance, 

30 via communication lines such as the Internet. 

The types and the number of the meta-data to be 
generated by the meta-data generation procedure can be 
different for different data types. However, there are many 
meta-data which have the same meaning regardless of the 

35 data types, such as date, location, title, and data 
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creator, for example. For these meta-data, it is preferable 
to create each meta-data generation procedure such that 
information extracted from data of different types will be 
registered as the meta-data of the same name. Namely, the 
5 meta-data generation procedure corresponding to the 
document data and the meta-data generation procedure 
corresponding to the JPEG data should preferably be created 
such that both the creation date extracted from the 
document data and the imaging date extracted from the EXIF 
10 information of the JPEG data will be registered under the 
meta-data name of "date". 

By such an implementation, it becomes possible to 
carry out meaningful retrieval even when data of different 
types coexist. For example, when the digital camera data in 
Lfij 15 the EXIF format, the digital video data in the DV format, 
J:* and the schedule data, all of which were created on the 

Lrl same date, are coexisting, it is possible to select the 

; : S schedule of that date and the camera images and the video 

a taken on that date by the retrieval specifying that date. 

! : ~! 20 Besides that, it also becomes possible to carry out the 
| e y retrieval for searching data regarding a certain location 

regardless of whether it is camera image or video. 
□ In the case where the data type of the registering 

data is the XML document, it is also possible to change a 
25 manner of generating the meta-data according to a content 
of that XML document . 

As shown in Fig. 10, the XML document has a tree 
structure with hierarchical tags called elements. Namely, 
what is enclosed between <book> and </book> is an element 
30 with an element name "book" , and this element contains 

plural elements such as an element enclosed between <title> 
and </title>. To the XML document, it is possible to 
associate a document definition information called DTD. In 
the example of Fig. 10, the DTD with a name "book.dtd" is 
35 associated. In the DTD, what elements appears in what order 
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or hierarchy in that document is defined. In the case of 
"book.dtd", it is defined that elements "title", "author", 
"publisher", etc., are arranged within a top level element 
"book". The XML document does not always have the DTD, and 
5 there can be XML documents without DTD. In such a case, 

there is a total freedom as to what elements are to be used 
in what order or hierarchy. As a method for defining the 
structure of the XML document, there are various available 
methods other than DTD, such as DDML , DCD , SOX, XML-DATA, 
10 and RDF, and any of these methods can be handled similarly 
in the meta-data registration device of this embodiment. 

In the Web server of this example, when the data 
registration request using the PUT request is received, the 
S meta-data generation procedure selection module 202 refers 

"j 5t? 

^ 15 to the data type/procedure correspondence table and judges 
i=y that it is the XML document if the specified MIME type is 

LH- "text/xml" or the extension is "xml". Then, the meta-data 

i% generation module 203 selectively executes the meta-data 

generation procedure corresponding to the data type of the 
■ 3S *j 20 XML document, where this meta-data generation procedure for 
ry the XML document refers to an XML meta-data correspondence 

! ; ~ table (not shown) to check whether there exists an element 

O specified by the XML meta-data correspondence table in the 

registering XML document or not, and extracts information 
25 contained in that element if that element exists. 

The XML meta-data correspondence table can be given by 
a table comprising three data items including a DTD name, 
an element name and a corresponding meta-data name, as 
shown in Fig. 11, for example. In the case of the XML 
30 document shown in Fig. 10, the DTD name is "book.dtd", and 
when the XML meta-data correspondence table of Fig. 11 is 
referred using this DTD name, it is indicated therein that 
information of the element under the name "title" should be 
set as a value of the meta-data under the name "title", 
35 information of the element under the name "author" should 
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be set as a value of the meta-data under the name "author" , 
and information of the element under the name "keywords" 
should be set as a value of the meta-data under the name 
"keyword". As a result, the meta-data as shown in Fig. 12 
5 can be generated. 

The types and/or the number of meta-data generated by 
the meta-data generation procedure for the XML document can 
be different for different XML documents to be registered. 
However, for the meta-data which have the same meaning but 

10 different element names in different XML documents, such as 
date, location, title, and data creator, for example, it is 
preferable to create the XML meta-data correspondence table 
such that information extracted from elements with 
different element names will be registered as the meta-data 

15 of the same name. Namely, the example of Fig. 11 is made 
such that both the "creator" information extracted from 
"photo. dtd" and the "author" information extracted from 
"book. dtd" will be registered under the meta-data name of 
"author". By this implementation, it is possible to carry 

20 out the meaningful retrieval even when XML documents with 
different DTDs are coexisting. 

In the above described example, the information to be 
extracted as the meta-data is specified by the DTD name and 
the element name in the XML meta-data correspondence table, 

25 but it is also possible to specify the information to be 
extracted as the meta-data by the element name alone. In 
such a case, whenever the element with the specified name 
exists, the corresponding meta-data is extracted regardless 
of what XML document it is. This applies similarly for 

30 those XML documents without DTD. 

It is also possible to specify an element from which 
the information should be extracted as the meta-data by a 
pair of a name space name and an element name. Here, the 
name space name is assigned in order to distinguish cases 

35 in which the same element name has different meanings 
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depending on the DTDs in which it is defined. 

In addition, instead of registering the information of 
the element specified in the XML meta-data correspondence 
table directly as a value of the meta-data, it is also 
5 possible to register the meta-data after removing 

unnecessary tags (such as tags of other elements that are 
contained in the specified element) from that information, 
summarizing the content of that information, or converting 
date or the like into some standard format, 

10 There are also cases where "application/xml" is used 

as the MIME type of the XML document, but these cases can 
be handled similarly as described above. 

The other structured documents such as HTML documents 
and SGML documents can also be handled similarly as the XML 

15 documents. 

A timing for generating and registering the meta-data 
using the meta-data registration device of this embodiment 
can be a timing for receiving and processing the data 
registration request at the data server device such as Web 

20 server, database server or file server, or the meta-data of 
the newly registered data can be generated and registered 
asynchronously after the data registration. 

The meta-data registration device of this embodiment 
is equally applicable to a file system capable of managing 

25 the meta-data. In the case of the Web server, all the data 
will be registered collectively in response to the PUT 
request, but in the case of the file system, it is 
preferable to handle cases where a part of the already 
existing data is to be updated. For this reason, it is 

30 preferable to carry out the meta-data generation and 

registration processing similar to that in the case of the 
Web server, at a time of closing files. It is also possible 
to record which file is updated at a time of closing files, 
and carry out the meta-data generation and registration 

35 processing asynchronously later on by searching out the 
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updated files. 

In the meta-data registration device of this 
embodiment, a suitable meta-data generation procedure is 
selected according to the data type of the registering 
5 data. As a method for judging the data type, apart from 
those already described above, it is also possible to 
employ a method which handles data entered from a specific 
input device as a prescribed type. For example, it is 
possible to assume that video data entered from the digital 

10 video device via the IEEE 1394 network are always in the DV 
format, and carry out the procedure for extracting the 
meta-data from data of the DV format type with respect to 
such video data. 

In the case of a system which can attach the meta-data 

15 not only to data but also to a directory, it is preferable 
to provide a function for updating the meta-data of a 
directory into which the data is to be registered, at a 
time of the data registration, in addition to the 
generation of the meta-data for that data. Namely, data or 

20 child directories contained in that directory may be 

changed by the new registration or deletion of data. For 
this reason, in addition to attaching the meta-data at a 
time of registration as in the case of the ordinary data, 
the need to change the meta-data of the directory is 

25 checked whenever data or child directory contained in the 
directory is added/deleted, and the meta-data of the 
directory is updated whenever necessary. 

The information to be updated as the meta-data of that 
directory at a time of addition/deletion of data or child 

30 directory within that directory includes a range of dates 
of data and meta-data, a range of locations of data and 
meta-data, and keywords for retrieval of data and meta- 
data. Namely, the meta-data of the directory contains 
information indicating common features of the meta-data of 

35 data or child directories within that directory. 
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The meta-data generation procedure required in 
attaching the meta-data to the directory is different from 
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the meta-data generation procedure for each data type in 
that it has to be managed in correspondence to the 
directory. As a method for maintaining this correspondence, 
there is a method which manages a pair of the directory 
name and the meta-data generation procedure for that 
directory in a correspondence table. As another method, 
there is also a method in which information on the meta- 
data generation procedure for that directory is included as 
a data under a specific name that is contained within that 
directory. As still another method, there is also a method 
in which a value of the meta-data with a specific name of 
that directory indicates information on the meta-data 
generation procedure of that directory or information for 
obtaining the meta-data generation procedure of that 
directory. Any of these methods may be used singly or in 
combination, it is also possible to check whether each 
method among a plurality of methods is applicable or not in 
a prescribed order, and use a method that is checked to be 
applicable first. 

An exemplary operation in the case of employing the 
first method for maintaining the correspondence described 
above is as follows. When the addition/deletion of data or 
child directory occurs, the access request processing unit 
301 or the data management unit 302 notifies this fact to 
the meta-data generation procedure selection module 202. 
Then, the meta-data generation procedure selection module 
202 selects the meta-data generation procedure (program) 
corresponding to the directory that contains the data or 
child directory to be added/deleted, by referring to a 
directory name/procedure correspondence table (not shown) . 
The meta-data generation module 203 then reads and executes 
the selected procedure. For example, the meta-data of that 
directory is read out from the storage device 303 through 
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the data management unit 302 and its content is checked, 
while the meta-data of the data or child directories 
contained in that directory (at least one of the data or 
child directory to be added/deleted and the others) are 
5 read out and their contents are checked, and in the case 
where the common features of the meta-data of the data or 
child directories contained in that directory will be 
changed by that addition/deletion, new meta-data for that 
directory is generated. Then, the meta-data registration 
10 module 204 rewrites the meta-data of that directory into 
the newly generated content. 

Besides that, it is also possible to change a manner 
of generating the meta-data at a time of generating the 
=1 meta-data of data to be registered by the meta-data 

a is? 
■ r» 

y\ 15 generation procedure, by referring to the meta-data of the 
j?Q directory into which this data is to be registered, as 

y\ indicated by a dashed line with arrowhead labelled "other 

i"f3 information" that is entered into the meta-data generation 

module 203 in Fig. 5. 
j75 20 For example, whether the directory has the meta-data 

TU of the same name as the meta-data generated from the 

!=5 registering data or not is checked. If it has, a value of 

□ the meta-data of the registering data and a value of the 

meta-data of the directory are compared, and if they differ 
25 (they do not coincide or a value of the meta-data of the 
registering data is not within a range of a value of the 
meta-data of the directory), this fact is notified to the 
data registration requesting source. By this 
implementation, it is possible to prevent the erroneous 
30 registration of data with different dates into a directory 
in which only data of the same date should be entered, for 
example. 

As another example, whether the directory has the 
meta-data of the same name as the meta-data generated from 
35 the registering data or not is checked. If it has, a value 
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of the meta-data of the registering data and a value of the 
meta-data of the directory are compared, and if they 
coincide, this meta-data is not generated for the 
registering data. In this way, when there is some data 
which does not have the meta-data of interest, a value of 
the meta-data with the same name of the directory in which 
that data exists can be used as a default value (or when 
data having a certain meta-data value is to be searched but 
there is some data which does not have that meta-data, the 
search can be carried out by regarding a value of that 
meta-data of the directory in which that data exists as a 
value of that meta-data of that data). As a result, in the 
case of the directory into which a plurality of data with 
the same date are contained, for example, the meta-data for 
the date is attached only to the directory and not to 
individual data, so that it is possible to save the storage 
capacity. Also, in this method, when most of the data in 
the directory use the default date as the meta-data, it is 
still possible to allow the entry of data having an 
exceptional value for the meta-data for the date into that 
directory (in which case, at a time of the search, the 
default date is used for those data which have no meta-data 
for the date and the date indicated by the meta-data for 
the date is used for those data which have the meta-data 
for the date) . 

Also, as other embodiments of the present invention, 
it is possible to implement a part of the functions of the 
meta-data registration device separately, in the external 
of the data server, rather than incorporating the entire 
meta-data registration device as a part of the data server 
device such as Web server or file system. 

Fig. 13 shows a configuration in which all the 
functions related to the meta-data generation among the 
functions of the meta-data registration device are 
implemented separately in the external of the data server 
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device. In this configuration, a function (201) for 
detecting the data type of the registering data, a function 
(202) for selecting the meta-data generation procedure 
corresponding to that data type, a function (203) for 
5 generating the meta-data using the selected procedure, and 
a function for returning the obtained result to the data 
server device 400 in a format shown in Fig. 9, for example, 
are provided in a meta-data extraction unit 402 which is 
Independent from the data server device 400. A meta-data 

10 generation calling unit 401 provided in the data server 

device 400 gives the data to the meta-data extraction unit 
402 and receives the meta-data generated as a result, and a 
meta-data registration unit 403 provided in the data server 
device 400 registers the received meta-data into the 

15 storage device 103 through the data management unit 102. 
Communications between the data server device 400 and the 
meta-data/^ extraction unit 402 are realized by inter-process 
communications or communications via network. 

Fig. 14 shows a configuration in which the function 

20 related to the meta-data generation with respect to a 

specific data type among the functions of the meta-data 
registration device are implemented separately in the 
external of the data server device. In this configuration, 
a function (503) for generating the meta-data for the data 

25 in the specific data type given from the data server device 
500 and returning the obtained result to the data server 
500 in a format shown in Fig. 9, for example, is provided 
in a meta-data extraction unit 502 which is independent 
from the data server device 500. A meta-data generation 

30 calling module 501 provided in the data server device 500 
gives the data to the meta-data extraction unit 502 when 
the generation procedure selected by the meta-data 
generation procedure selection module 202 should be 
executed outside the data server device 500, and receives 

35 the meta-data generated as a result. There may be a 
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plurality of meta-data extraction units 502 in 
correspondence to a plurality of data types. In such a 
case, the meta-data generation calling module 501 gives the 
data to one meta-data extraction unit 502 that has the 
5 procedure specified from the meta-data generation procedure 
selection module 202 among a plurality of meta-data 
extraction units 502. 

In the case of adopting a configuration as shown in 
Fig. 14, information necessary in calling the external 
10 function for extracting the meta-data for the data type is 
recorded in an entry of the meta-data generation procedure 
for that data type in the meta-data generation procedure 
table to be referred by the meta-data generation procedure 
selection module 202. For example, in the case where the 
15 data server device 500 and the meta-data extraction unit 
502 communicate using SOCKET which is the inter-process 
communication function of the UNIX, an address of SOCKET is 
recorded. It is also possible to use both the configuration 
of Fig. 14 and the configuration of Fig. 6 in combination, 
j 20 As described, according to the present invention, it 

is possible to attach the meta-data to the registering data 
automatically, so that it is possible to relieve the user 
3 from the tedious task of attaching the meta-data to each 

individual registering data, and the necessary meta-data 
25 can be attached without failure. 

In addition, the meta-data under the same names can 
always be attached to data of the same data type, and even 
for data of the different data types, the same name can be 
used for the meta-data which have the same meaning at a 
30 level of the meta-data generation procedure, so that it is 
possible to carry out the retrieval specifying the same 
meta-data name with respect to a wide range of data. 

It is to be noted that the above described embodiments 
35 according to the present invention may be conveniently 
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implemented using a conventional general purpose digital 
computer programmed according to the teachings of the 
present specification, as will be apparent to those skilled 
in the computer art. Appropriate software coding can 
5 readily be prepared by skilled programmers based on the 

teachings of the present disclosure, as will be apparent to 
those skilled in the software art. 

In particular, the meta-data registration device of 
the data server device of the above described embodiments 
10 can be conveniently implemented in a form of a software 
package . 

Such a software package can be a computer program 
product which employs a storage medium including stored 
computer code which is used to program a computer to 

15 perform the disclosed function and process of the present 
invention. The storage medium may include, but is not 
limited to, any type of conventional floppy disks, optical 
disks, CD-ROMs, magneto-optical disks, ROMs, RAMs , EPROMs , 
EEPROMs , magnetic or optical cards, or any other suitable 

20 media for storing electronic instructions. 

It is also to be noted that, besides those already 
mentioned above, many modifications and variations of the 
above embodiments may be made without departing from the 
novel and advantageous features of the present invention. 

25 Accordingly, all such modifications and variations are 

intended to be included within the scope of the appended 
claims. 

30 
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